More Words and Bigger Pictures

نویسنده

  • David A. Forsyth
چکیده

Object recognition is a little like translation: a picture (text in a source language) goes in, and a description (text in a target language) comes out. I will use this analogy, which has proven fertile, to describe recent progress in object recognition. We have very good methods to spot some objects in images, but extending these methods to produce descriptions of images remains very difficult. The description might come in the form of a set of words, indicating objects, and boxes or regions spanned by the object. This representation is difficult to work with, because some objects seem to be much more important than others, and because objects interact. An alternative is a sentence or a paragraph describing the picture, and recent work indicates how one might generate rich structures like this. Furthermore, recent work suggests that it is easier and more effective to generate descriptions of images in terms of chunks of meaning (”person on a horse”) rather than just objects (”person”; ”horse”). Finally, if the picture contains objects that are unfamiliar, then we need to generate useful descriptions that will make it possible to interact with them, even though we don’t know what they are. About the Speaker David Forsyth is currently a full professor at U. Illinois at Urbana-Champaign, where he moved from U.C Berkeley, where he was also full professor. He has published over 130 papers on computer vision, computer graphics and machine learning. He has served as program chair and as general chair for various international conferences on computer vision. He received an IEEE technical achievement award for 2005 for his research and became an IEEE fellow in 2009. His textbook, ”Computer Vision: A Modern Approach” (joint with J. Ponce and published by Prentice Hall) is widely adopted as a course text. A second edition appeared in 2011. He was named editor in chief of IEEE TPAMI for a term starting in Jan 2013.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Korean deaf adolescents' awareness of thematic and taxonomic relations among ordinary concepts represented by pictures and written words.

Individuals' relative awareness of thematic and taxonomic relations is influenced by factors such as language and background knowledge. Relatively weak in Korean language skills and also having relatively limited social opportunities, Korean deaf adolescents might be different from hearing adolescents in how they make decisions in taxonomically and thematically associated entities represented b...

متن کامل

The properties of retrieval cues constrain the picture superiority effect.

In three experiments, we examined why pictures are remembered better than words on explicit memory tests like recall and recognition, whereas words produce more priming than pictures on some implicit tests, such as word-fragment and word-stem completion (e.g., completing -l-ph-nt or ele----- as elephant). One possibility is that pictures are always more accessible than words if subjects are giv...

متن کامل

Neural correlates of the episodic encoding of pictures and words.

A striking characteristic of human memory is that pictures are remembered better than words. We examined the neural correlates of memory for pictures and words in the context of episodic memory encoding to determine material-specific differences in brain activity patterns. To do this, we used positron emission tomography to map the brain regions active during encoding of words and pictures of o...

متن کامل

Neural processing of emotional pictures and words: a comparison of young and older adults.

Recent findings have revealed age-related changes in neural recruitment during the processing of emotional information. The present study examined whether these age-related changes would be more pronounced for words, thought to be processed in a controlled manner versus relatively automatically processed pictures. Compared to young adults, older adults showed less amygdala activation, and more ...

متن کامل

Selective Activation Around the Left Occipito-Temporal Sulcus for Words Relative to Pictures: Individual Variability or False Positives?

We used high-resolution fMRI to investigate claims that learning to read results in greater left occipito-temporal (OT) activation for written words relative to pictures of objects. In the first experiment, 9/16 subjects performing a one-back task showed activation in > or =1 left OT voxel for words relative to pictures (P < 0.05 uncorrected). In a second experiment, another 9/15 subjects perfo...

متن کامل

Differences in word associations to pictures and words.

Normal subjects were asked to produce the "first word that comes to mind" in response to pictures or words that differed with respect to manipulability and animacy. In separate analyses across subjects and items, normal subjects produced a significantly higher proportion of action words (that is, verbs) to pictures as compared to words, to manipulable as compared to non-manipulable stimuli and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013